Towards Constructing Sports News from Live Text Commentary
نویسندگان
چکیده
In this paper, we investigate the possibility to automatically generate sports news from live text commentary scripts. As a preliminary study, we treat this task as a special kind of document summarization based on sentence extraction. We formulate the task in a supervised learning to rank framework, utilizing both traditional sentence features for generic document summarization and novelly designed task-specific features. To tackle the problem of local redundancy, we also propose a probabilistic sentence selection algorithm. Experiments on our collected data from football live commentary scripts and corresponding sports news demonstrate the feasibility of this task. Evaluation results show that our methods are indeed appropriate for this task, outperforming several baseline methods in different aspects.
منابع مشابه
Content Selection for Real-time Sports News Construction from Commentary Texts
We study the task of constructing sports news report automatically from live commentary and focus on content selection. Rather than receiving every piece of text of a sports match before news construction, as in previous related work, we novelly verify the feasibility of a more challenging setting to generate news report on the fly by treating live text input as a stream. We design scoring func...
متن کاملSports News Generation from Live Webcast Scripts Based on Rules and Templates
With the dramatic increase of the live webcast scripts about sports, it is an urgent demand to write and publish a sports news article immediately after a sports game. However, so far, the sports news articles are usually written by human experts or journalists, and the manual writing of sports news is timeconsuming and inefficient. This paper describes our system on the sports news generation ...
متن کامل39. Opinion mining and sentiment analysis
Opinions are ubiquitous in text, and readers of on-line text — from consumers to sports fans to news addicts to governments — can benefit from automatic methods that synthesise useful opinion-orientated information from the sea of data. In this chapter on opinion mining and sentiment analysis, we introduce an idealised, end-to-end opinion analysis system and describe its components, including c...
متن کاملNUS at WMT09: Domain Adaptation Experiments for English-Spanish Machine Translation of News Commentary Text
We describe the system developed by the team of the National University of Singapore for English to Spanish machine translation of News Commentary text for the WMT09 Shared Translation Task. Our approach is based on domain adaptation, combining a small in-domain News Commentary bi-text and a large out-of-domain one from the Europarl corpus, from which we built and combined two separate phrase t...
متن کاملOverview of the NLPCC-ICCPOL 2016 Shared Task: Sports News Generation from Live Webcast Scripts
Live webcast scripts are valuable resources for describing the process of sports games. This shared task aims to automatically generate sports news articles from live webcast scripts. The task can be considered a special case of single document summarization. In this overview paper, we will introduce the task, the evaluation dataset, the participating teams and the evaluation results. The datas...
متن کامل